Continuous Microphone Array Speech Recognition on Wall Street Journal Corpus

نویسندگان

  • Hari Krishna Maganti
  • Jithendra Vepa
  • Hervé Bourlard
چکیده

In this paper, we present a robust speech acquisition system to acquire continuous speech using a microphone array. A microphone array based speech recognition system is also presented to study the environmental interference due to reverberation, background noises and mismatch between the training and testing conditions. This is important in the context of smart meeting rooms of Augmented MultiParty Interaction (AMI) project which aims at significant development of conversational speech recognition. In this regard, an audio-visual database containing the Wall Street journal phrases was recorded in a real meeting room for the stationary speaker, moving speaker and overlapping speech scenarios. We carried out speech enhancement and continuous speech recognition experiments on stationary speaker data. Using a microphone array with beamformer followed by a postfilter enhances speech quality slightly inferior to that of close-talk headset,and better than lapel. We achieved a significant reduction in word error rates using models adapted based on maximum linear likelihood regression (MLLR) and maximum-a-posteriori (MAP) approaches. Though the error rates of the microphone array data are larger than those of headset data, they are significantly smaller compared to the error rates of lapel data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Training of isolated word recognizers with continuous speech

Is it possible to use out-of-domain acoustic training data to improve a speech recognizer's performance on a speci c, independent application? In our experiments, we use Wallstreet Journal (WSJ) data to train a recognizer, which is adapted and evaluated in the Phonebook domain. Apart from their common language (US English), the two corpora di er in many important respects: microphone vs. teleph...

متن کامل

Pronunciation Modelling in the Rwth Large Vocabulary Speech Recognizer

In this paper we describe the application of pronunciation variants for our large vocabulary continuous speech recognizer. We will explain how the pronunciation variants were used in training and recognition and give some recognition results on three different corpora. The recognition tests were performed on the Wall Street Journal (WSJ) November 92 development and evaluation corpora (5 000 wor...

متن کامل

Experiments on Recognition of Lavali Whispered Speech in Real Wo

In this paper, we present corpora and recognition experiments of the speech recorded in everyday life for the real world speech recognition. A speech corpus of 8,600 sentences from 53 speakers recorded through lavalier microphones in four different environments is built. The data was collected in an office space, a sound-proof room, in cars of different sizes, and on the street. Another corpus ...

متن کامل

Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data

In this contribution we investigate the effectiveness of BAYESIAN feature enhancement (BFE) on a medium-sized recognition task containing real-world recordings of noisy reverberant speech. BFE employs a very coarse model of the acoustic impulse response (AIR) from the source to the microphone, which has been shown to be effective if the speech to be recognized has been generated by artificially...

متن کامل

Issues in Large Vocabulary, Multilingual Speech Recognition

In this paper we report on our activities in multilingual, speaker-independent,large vocabulary continuous speech recognition. The multilingual aspect of this work is of particular importance in Eu-rope, where each country has its own national language. Our existing recognizer for American English and French, has been ported to British English and German. It has been assessed in the context of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005